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ABSTRACT 

Theoretical and observational arguments suggest that the intergalactic medium (IGM) might have 
been polluted with metals produced by early star formation. In this scenario, Lyman-break galaxies 
(LBGs) at redshift z = 3 are likely to be surrounded by old metal bubbles, relics of an era when the 
characteristic mass of galaxies was small and gas retainment more difficult. We find that pregalactic 
enrichment of the IGM from 10 8 — 10 10 M Q dwarf galaxies at 6 < z < 12 can quantitatively explain the 
high cross-correlation between C iv systems and LBGs observed at z = 3. The reason is twofold. First, 
both LBGs and high-z dwarfs are biased tracer of the mass distribution and form from exceptionally 
high density fluctuations which are strongly clustered. Second, the action of gravity tends to increase 
the spatial association between metal bubbles and LBGs. Our analysis shows that, in order to match 
the abundance of C iv systems observed at z = 3, the metal bubbles generated by high-z dwarfs must 
have comoving sizes of ~ 100 kpc. We conclude that the observed galaxy-C iv spatial association 
needs not to be generated by late "superwinds" from LBGs. 

Subject headings: cosmology: theory - galaxies: high-redshift - intergalactic medium - quasar: ab- 
sorption lines 



1. INTRODUCTION 

Quasar absorption lines constitute a fossil record that 
holds the clues to the exchange of mass, metals, and en- 
ergy between active galactic nuclei, protogalaxies, and 
the intergalactic medium (IGM). The distribution of 
metals in the IGM is highly inhomogeneous, with a global 
cosmic abundance at z = 3 of [C/H]= — 2.8±0.13 for gas 
with overdensities 0.3 < S < 100 (Schaye et al. 2003). 
The epoch at which this cosmic enrichment occurred is 
uncertain, and so is the nature of the galaxies responsible 
for seeding the IGM with nuclear waste. The extent of 
metal pollution seem to require galaxy-scale outflows to 
overcome the gravitational potential of their host halos, 
and enriched material to be ejected far away from the 
density peaks where large galaxies form, gas cools, and 
star formation takes place. 

Observationally, C iv absorbers and Lyman-break 
galaxies (LBGs) at z as 3 are found to be spatially cor- 
related (Adelberger et al. 2003, hereafter ASSP). The 
mean number of LBGs within a comoving distance r 
from a detectable C iv system (New > 10 115 cm~ 2 ) 
equals one for r — ri = 2.4 ft -1 Mpc. This is much 
smaller than the value expected for a random distribu- 
tion (n ~ 3.9 hr 1 Mpc, ASSP). Stronger C iv systems 
seem to be even more closely associated with LBGs 
(ri < 0.5 h- 1 Mpc for A C iv k, 10 13 5 cm- 2 ). At the 
same time, the Lya absorption produced by the IGM 
within ~ 0.5 ft -1 Mpc from an LBG appears to be sys- 
tematically reduced with respect to random locations. 
Moreover, high-resolution spectra reveal that LBGs have 
high star formation rates and strong winds with outflow 
velocities exceeding several hundred kilometers per sec- 
ond. All these facts led ASSP to argue that metal-rich 
"superwinds" from LBGs are responsible for distributing 
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the product of stellar nucleosynthesis on Mpc scales. 

In this Letter we argue that this interpretation may 
be an oversimplication of a far more complex physical 
picture. The starting point of our investigation is the 
fact that LBGs are highly biased tracers of the under- 
lying mass-density field. The observed clustering am- 
plitude suggests that LBGs reside in dark-matter halos 
with mass M > 10 U5±0 - 3 M at z = 2.9 (Porciani 
& Giavalisco 2002; Adelberger et al. 2005). In hier- 
archical cosmogonies, these halos are expected to form 
within overdense regions (extending for a few Mpc) which 
are likely to be pre-enriched by subgalactic objects at 
much higher redshifts. The ejection of supernova debris 
from the shallow potential wells of collapsed, rare den- 
sity fluctuations is, in fact, expected to lead to an early 
era of metal pollution and preheating in biased regions 
of the IGM (e.g. Tegmark et al. 1993; Madau et al. 
2001; Scannapieco et al. 2002; Thacker et al. 2002). We 
show that pregalactic enrichment from 10 s — 10 10 M Q 
dwarf galaxies at 6 < z < 12 can quantitatively ex- 
plain both the observed close spatial coincidence be- 
tween C iv systems and LBGs and the abundance of 
C iv absorbers at z = 3. Throughout this paper we will 
adopt a flat ACDM background cosmology with parame- 
ters (fi A , n M , Q b , n, ct 8 , ft) = (0.3, 0.7, 0.05, 1, 0.87, 0.65). 

2. BASIC THEORY 

The properties of the dark-matter halo population as a 
function of redshift can be approximately computed with 
the excursion-set formalism (Bond et al. 1991). In this 
approach, the halo mass function n(M, z) is determined 
by following the random walk of the linear density con- 
trast, (5, as a function of spatial resolution (parameterized 
by the mass variance, S) and measuring the rate with 
which different trajectories first upcross a given thresh- 
old value, t c = 5 c (z) / 'D(z). Here S c is the linear over- 
density of a top- hat perturbation at collapse (5 C = 1.686 
in a Einstein-de Sitter universe) and D(z) is the linear 
growth factor of density perturbations, normalized such 
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that D{z = 0) = 1. 

Porciani et al. (1998) have shown that the excursion-set 
formalism can be also used to derive the clustering prop- 
erties of dark-matter halos in the Lagrangian coordinate 
system (which gives the positions of proto-halo centers in 
the linear density field, formally at z — > oo). This is ob- 
tained accounting for the spatial coherence of the linear 
density field and simultaneously following a number of 
correlated random walks associated with different spa- 
tial locations. More recently, Scannapieco & Barkana 
(2002) generalized these results to halos collapsing at 
different epochs. Here we use their analytical approxi- 
mation to compute the Lagrangian correlation function 
^h(rh\Mi, zi, M 2 , z 2 ), between halos of mass Mi forming 
at z\ and halos of mass M% forming at z 2 > z\. Hereafter, 
Mi > 10 115 Mq will denote the mass of the host-halo of 
an LBG forming at z\ = 3 while the free-parameters M 2 
and Z2 will characterize subgalactic halos collapsing at 
earlier epochs. 

In order to compare theory and observations, one 
needs to compute the halo clustering properties in the 
Eulcrian coordinate system (in which galaxies are ob- 
served) . The mapping of halo positions from Lagrangian 
to Eulerian space depends on the gravity driven dy- 
namics of the background mass density field (Catelan 
et al. 1998). Consider the evolution of a spherically- 
symmetric shell of Lagrangian radius tl centered onto a 
halo of mass Mi 3> M 2 . Its Eulerian position at red- 
shift Z\ only depends on the mean overdensity A(zi) 
within it: r E = ?t/[1 + A(zi)] 1 / 3 (for the sheared case 
see Catelan et al. 1998). Using the spherical top-hat 
model it is possible to relate the non-linear density con- 
trast A to its linearly extrapolated counterpart 5, as 
5 ~ 5 C [1 - (1 + A)- 1 /5c]/ £) ( z ) (p av iidou & Fields 2004). 
This implies r E = f(S,r L ) = r L [l - (D(z) 5/S c )] 5 ^ 3 . 
What is the probability distribution of the linear over- 
density within a shell of given Lagrangian size centered 
onto an halo of mass Mi? This can be easily computed 
using the excursion-set model. The conditional probabil- 
ity that a trajectory assumes the value 6 when smoothed 
on the scale tl (corresponding to the mass variance s), 
given that it will first upcross the threshold t c (zi) at the 
mass variance S (corresponding to the mass Mi), is 
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denotes the probability distribution of first upcrossings. 
The conditional probability distribution of r E given a 
Lagrangian separation can then be written as 



V{r E \n) - 



to 



V(S\r L )S D [r E -f(S,r L )}d5 
3V{S\r L ) 



r L D( Zl )\l-S/t c \^m-^ 



(4) 



2 
1.5 
I 1 
0.5 



centered 
on a LBG 



L =l'.5' Mpfe 




r L =3 Mpc 



-2-10 1 2 

<5 (linearly extrapolated to z = 3) 



£ 4 

Oh 

2 





Fig. 1. — Top: Probability distribution at z = 3 of the linear 
overdensity in a sphere centered on a dark matter halo with mass 
Ml = 10 lf - 5 M Q (solid; see eq. and on a random point ( dashed) . 
The three sets of curves correspond to spheres with Lagrangian 
radii of 1.5, 2 and 3 comoving Mpc. The Lagrangian radius of the 
central halo is 1.3 Mpc. Bottom: Probability distribution at z = 3 
of the Eulerian separation between a dwarf galaxy (of any mass 
M2 and collapse redshift Z2) and an LBG (see eq. |lj for a given 
Lagrangian separation (indicated by labels and dashed lines). 



where 5 = 5 C [l-(r E /r L )^ s "}/D(z 1 ), T(S\r L ) denotes the 
distribution given in equation Q), and 5d(x) is the Dirac 
delta function. The mean number of objects (per unit 
length) with mass (M 2 ,M 2 + dM 2 ) located at distance 
te from a halo of mass Mi is then 



dN 2 
drE 



= 4n V(r E \r L )n(M 2 ,z 2 )[l+^(r L )]rldr L , (5) 

with i?i = (SMi/Attp) 1 ' 3 the Lagrangian radius of the 
halo of mass Mi and p the mean comoving density of 
the universe. For a population of halos with an extended 
range of masses at zi, this becomes 



dN 2 
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Mi 



Jn{M 1 ,z 1 )^dM 1 
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J n(M 1 ,z 1 )dM 1 
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Note that, by redshift z\, halos with mass M 2 and 
tl < i?i have merged with the "central" object of 
mass Mi. The mass function of the remaining halos 
is 7V(M 2 ,zi) = n{M 2 ,z 2 ) - J n(M 1 , zi)N 12 dM 1 , where 

iVi 2 = 47rn(M 2 ,z 2 ) J* 1 [1 + ClK)] r£ dr L indicates the 
mean number of M 2 halos which ended up within a sin- 
gle Mi halo (this is in good agreement with Lacey & 
Cole 1993). From the definition of the Eulerian cross- 
correlation, we then obtain 
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3. RESULTS 

We can now turn to the main point of this Letter. 
Can the observed correlation between C iv absorbers and 
LBGs at z w 3 be explained by SN-driven outflows from 
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Fig. 2. — Left: Projected cross-correlation function between LBGs and C IV systems. Points with errorbars denote the observational 
data by ASSP, while the solid line shows our best-fitting model (which corresponds to Xmin = 3-6). The shaded area indicates separations 
smaller than the Eulerian radius of a 10 11,5 Mq halo, where our model does not hold. The inset shows the small-scale modifications due 
to a finite bubble size. Right: Contour levels of the \ 2 function obtained by fitting the data by ASSP with the model described in the 



text. Solid contour: Ax X — xf 



2.3 and 6.17 (respectively marking the 68.3 and 95.4 per cent confidence levels for two Gaussian 



variables). Long-dashed curve: set of points where D(z2) 1,15 S(M2) = 2.9, which describes the location of the degenerate minimum. Light, 
short-dashed lines: set of points where the number density of C IV systems matches the observed value for different values of rb [ranging 
from 50 to 250 comoving kpc (from bottom to top) and spaced by 50 kpc] . The shaded area marks the region where virialized gas cannot 
cool via excitation of hydrogen Lyct. 



low mass systems at much higher redshiftsl Figure 1 
shows that LBGs form within overdense regions extend- 
ing for a few comoving Mpc. Assume that these biased 
regions have been already polluted by protogalaxies, i.e. 
that C iv systems are associated with metal "bubbles" 
that have been expelled from halos of mass M2 at some 
epoch between and z\? Most of the bubbles lying 
in the vicinity of the formation site of an LBG will be 
driven by gravity towards the forming galaxy thus pro- 
ducing the observed spatial association (see bottom panel 
in Fig. 1). 

In this scenario, the projected cross-correlation func- 
tion between LBGs and the bubble centers is: 



B(r±) 
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where r± denotes the component of the Eulerian separa- 
tion te in the plane of the sky. For simplicity, we assume 
that each halo of mass M2 gives origin to a spherical 
C iv bubble of comoving radius rb- The cross-correlation 
function between LBGs and C iv systems is then ob- 
tained by convolving the correlation function in equation 
(JSJ with a 2-dimensional top-hat window function (see 
the Appendix in Porciani & Giavalisco 2002). In Figure 
2, we show that the free parameters of our model can 
be tuned to reproduce the correlation function measured 
by ASSP. Here we consider a unit logarithmic interval 

3 Note that, at redshift z\, most of the halos of mass M2 have 
lost their identity by merging into larger halos (of mass < Mi ) . We 
do not follow the details of this process and assume that all the 
C IV bubbles preserve their identity. Since the bubble size is small 
with respect to the spatial separations achievable by observations, 
this assumption does not alter our conclusions. 



in mass for Mi > 1O 115 M0. The best-fitting solution is 
not unique: models with values of (-M2, z-i ) linked by the 
equation D{z-x) ^ S(M%) — 2.9 reproduce the data with 
equal accuracy (right panel in Fig. 2). This corresponds 

to perturbations with v = t c /\/~S ~ D(z2)~ ' 425 . In the 
redshift interval 6 < z^ < 12, v ranges from 2 to 2.7, i.e. 
with increasing zi rarer and rarer density fluctuations 
are required to pollute the surroundings of LBGs in or- 
der to match the observed H. Note that massive LBGs 
at z = 3 corresponds to v = 2 peaks, i.e. the dwarf 
galaxies that are the source of C iv absorption around 
LBGs must have been equally or more biased tracers of 
the mass distribution than the LBGs themselves. These 
high-^ halos form first and, most likely, inhibit star for- 
mation (and thus the production of new metal bubbles) 
in lower density peaks which collapse at a more recent 
epoch (Thacker et al. 2002). 

Figure 2 also shows that the clustering data by ASSP 
cannot constrain r^. Winds from star-forming low-mass 
systems at z ~ 10 are expected to sweep up regions of 
the IGM of comoving size r b < 250 kpc (Mori et al. 2002; 
Furlanetto & Loeb 2003) while S has been measured for 
separations r > 0.3 h^ 1 Mpc where the bubble size leaves 
no imprint. We thus constrain rb by comparing the ob- 
served mean number of C iv absorbers per unit redshift 
with the expectations from our model. From Songaila 
(2001), we infer that dN/dz ~ 8 for C iv systems with 
Aery > 10 13 cm -2 at z ~ 3.2. This corresponds to a 
bubble size of the order of 100 comoving kpc and im- 
plies that rb has to increase when one moves towards 
lower redshifts along the (Ma, 22) degeneracy line (see 
Fig. 2). Thus enriched outflows of size rb = 100 kpc 
around 10 9 M Q halos at z = 9 (or of size rb = 200 kpc 
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around 10 10 M Q halos at z = 6) might explain both the 
mean absorption-line density and the LBG-C iv cross- 
correlation observed at redshift 3. 

4. DISCUSSION 

In hierarchical cosmologies, LBGs represent only the 
tail end of a larger population of pregalactic starbursts 
that formed at much higher redshifts. The metal pollu- 
tion of the IGM may then have started at early times, 
perhaps by the same sources responsible for hydro- 
gen reionization. In "pregalactic enrichment" scenar- 
ios, LBGs are expected to form within biased regions 
of the IGM (as indicated by their strong clustering), 
which are the sites of previous star formation and enrich- 
ment. In this Letter we showed that the LBG-C iv cross- 
correlation function measured by ASSP can be explained 
by outflows from dwarf galaxies (10 s < M < 10 10 M Q ) 
collapsing at 6 < z < 12. These originate from density 
peaks with 2 < v < 2.7 (equally or more biased than 
the LBGs themselves) and are gravitationally pulled to- 
wards the LBG formation sites. Note that, assuming 
that LBGs are hosted by halos with M > 10 115 M Q , our 
model automatically reproduces also the observed cor- 
relation function of LBGs (Porciani & Giavalisco 2002; 
Adelberger et al. 2005). Contrary to previous beliefs, 
our results then indicate that the observed coincidence 
between LBGs and C iv systems does not necessarily im- 
ply that the metals observed in the IGM at z ~ 3 were 
driven out of massive LBGs by a superwind. 

Typical LBGs seem to generate gas outflows with ve- 
locities in the range < v < 600 kms -1 (ASSP). Ma- 
terial moving at a constant speed of 600 kms -1 for 300 
Myr can only cross 0.7 comoving Mpc at z — 3. Real- 
istic SN-driven winds will slow down as they physically 
displace baryons in the IGM, and can propagate out to 
r < 1 Mpc only if they are able to tap 100% of the avail- 
able SN energy (e.g. Croft et al. 2002). The association 
between LBGs and the weakest metal line absorbers can- 
not then be readily explained by superwinds. Moreover, 
while the distribution of heavy elements in the Lya forest 
shows a positive gradient with overdensity, there is lit- 
tle evolution of the total metal content of the IGM over 
the redshift range z =1.5-5 (Schaye et al. 2003; Pettini 
et al. 2003; Songaila 2001), as expected if most of the 
intergalactic metals were already in place at the high- 
est currently observable redshifts. All this provides sup- 
porting evidence for our proposed scenario where SN- 
driven winds from a biased population of subgalactic ha- 
los at very high redshift pollute the regions surround- 



ing LBGs with metals. This does not require the winds 
to propagate to large distances. Comoving metal bub- 
bles of ~ 100 kpc expelled by halos of M 2 = 10 9 M© at 
z = 9 are sufficient to explain the observed abundance of 
C iv systems at z ~ 3. 

It is fair to point out at this stage that the initial 
epochs of the galaxy formation process are currently 
only poorly understood. N-body+hydrodynamical sim- 
ulations of CDM cosmogonies have convincingly shown 
that the IGM is expected to fragment into bound struc- 
tures at early times, but are much less able to predict the 
efficiency with which gravitationally collapsed objects lit 
up, reionized, and enriched the universe at the end of the 
"dark ages" . Still, from the arguments presented above, 
it would seem unwise to use the spatial association be- 
tween C iv absorbers and LBGs and draw far-reaching 
conclusions on the extent of galactic superwinds and the 
direct impact of LBGs on the Lya forest. Uncovering 
the relative contributions of LBGs and young dwarfs to 
the pollution of the IGM may prove difficult. Metals 
ejected during an early episode of star formation may 
be seen in absorption against bright background sources 
such as rare quasars or gamma-ray bursts at z ~ 10 
(Furlanetto & Loeb 2003). This, however, requires high- 
resolution spectroscopy of faint sources in the near-IR 
and may have to wait for the James Webb Space Tele- 
scope. On the other hand, old wind bubbles may have dif- 
ferent abundance patterns than young winds from LBGs. 
As recently pointed out by Cen et al. (2004), the gaseous 
relics of an era when the characteristic mass of galaxies 
was small, gas retainmcnt more difficult, and metal recy- 
cling more limited, are expected to contain lower ratios 
of secondary (e.g. Nitrogen) to primary (e.g. Carbon, 
Oxygen) metals. For instance, theoretical yields from 
pair-instability SNe originating from massive Population 
III stars are characterized by large Si/C ratios (Heger & 
Woosley 2002). Tests relying on measurements of relative 
metal abundances at high-z, however, are made imprac- 
tical by the uncertainties in the ionization corrections 
and the lack of information regarding the stellar initial 
mass function. 
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